AUC Maximization for Low-Resource Named Entity Recognition

نویسندگان

چکیده

Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize underlying NER model. Both of these traditional objective for problem generally produce adequate performance when data distribution is balanced and there are sufficient annotated training examples. But since inherently an imbalanced tagging problem, model under low-resource settings could suffer using standard functions. Based on recent advances area ROC curve (AUC) maximization, we propose by maximizing AUC score. We give evidence that simply combining two binary-classifiers maximize score, significant improvement over loss achieved settings. also conduct extensive experiments demonstrate advantages our method highly-imbalanced To best knowledge, this first brings maximization setting. Furthermore, show agnostic different types embeddings, models domains. The code available at https://github.com/dngu0061/NER-AUC-2T.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Using Wikipedia as a Resource for Arabic Named Entity Recognition

In this paper we describe a novel approach to delimit named entities and nominal successive mentions in the Arabic Wikipedia. To achieve this a linguistic analysis of named entities and successive mentions in the Arabic Wikipedia, in terms of coverage and complexity, is presented. A supervised machine learning classifier has been used to predict the presence of the named entities in the Arabic ...

متن کامل

A Golden Resource for Named Entity Recognition in Portuguese

This paper presents a collection of texts manually annotated with named entities in context, which was used for HAREM, the first evaluation contest for named entity recognizers for Portuguese. We discuss the options taken and the originality of our approach compared with previous evaluation initiatives in the area. We document the choice of categories, their quantitative weight in the overall c...

متن کامل

Named Entity Recognition for Ukrainian: A Resource-Light Approach

Named entity recognition (NER) is a subtask of information extraction (IE) which can be used further on for different purposes. In this paper, we discuss named entity recognition for Ukrainian language, which is a Slavonic language with a rich morphology. The approach we follow uses a restricted number of features. We show that it is feasible to boost performance by considering several heuristi...

متن کامل

Phonologically Aware Neural Model for Named Entity Recognition in Low Resource Transfer Settings

Named Entity Recognition is a well established information extraction task with many state of the art systems existing for a variety of languages. Most systems rely on language specific resources, large annotated corpora, gazetteers and feature engineering to perform well monolingually. In this paper, we introduce an attentional neural model which only uses language universal phonological chara...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i11.26571